Efficient quantization of speech excitation parameters using temporal decomposition
نویسندگان
چکیده
In this paper, we investigate the application of temporal decomposition (TD) technique to describe the temporal patterns of speech excitation parameter contours, i.e. gain, pitch, and voicing. We use a common set of event functions to describe the temporal structure of both spectral and excitation parameters, and then quantize them. Experimental results show that each speech excitation parameter contour can be well described by a set of excitation targets using the event functions obtained from TD analysis of line spectral frequency (LSF) parameters, with considerably low reconstruction error. Moreover, we can efficiently quantize the excitation targets by a combination of two uniform quantizers, one working directly on logarithmic excitation targets and the other working on the difference between current and previous logarithmic excitation targets.
منابع مشابه
Coding Speech at Very Low R and Temporal Deco
This paper presents a new method for speech coding at rates around 1.2 kbps based on STRAIGHT, a high quality speech analysis-synthesis method. For encoding spectral information, Modified Restricted Temporal Decomposition (MRTD) based vector quantization is used, where MRTD is a method of temporal decomposition for line spectral frequency parameters. Meanwhile, pitch and gain parameters are cod...
متن کاملVery low rate speech coding using temporal decomposition and waveform interpolation
In very low rate coding the aim is to accurately represent speech characteristics as efficiently as possible. High coding gains for the spectral features can be achieved through the use of temporal decomposition. Waveform interpolation coders accurately represent the excitation using characteristic waveforms (CWs) extracted at a constant rate. In this paper, the two approaches are combined into...
متن کاملEfficient quantization of LSF parameters based on temporal decomposition
In this paper, we present a restricted temporal decomposition method for LSF parameters. The event vectors estimated by this method preserve the ordering property of LSF parameters so that they can be quantized efficiently. Experimental results show that interpolated LSF parameters can be quantized transparently at the rate of 753 bps. We also design a LPC vocoder at 996 bps as an application o...
متن کاملA Glottal Vocoder Employing Vector Quantization
This paper describes a speech coder for low bit rates using a parametric representation of voiced excitation waveforms (Glottal ARX) and standard LPC for unvoiced. For efficient compression purposes the excitation and spectrum parameters are quantized with vector quantization (VQ). This has resulted in a glottal vocoder operating at 1320 bits/s and sounding more natural than a standard LPC voco...
متن کاملSpeech synthesis by structured segments, using temporal decomposition and a glottal excitation
Classical speech synthesis systems either concatenate diphone-like tabulated pattems or reconstmct speech parameters according to pre-defmed mles. Both techniques show drawbacks : the fonner lacks flexibility while the lauer is highly time-consuming_ to built. We propose an intennediate technique using structured segments : segmental units are still resorted to, but they are automatically analy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003